CDS

Accession Number TCMCG075C23604
gbkey CDS
Protein Id XP_017981489.1
Location complement(join(3556528..3556797,3556926..3557052,3557243..3557359,3557651..3557807,3557898..3558039,3558261..3558351,3558661..3558767,3558869..3559024,3559121..3559294,3559418..3559559,3559738..3559857,3559952..3560071,3560308..3560367,3560486..3560607,3560741..3560782))
Gene LOC108663123
GeneID 108663123
Organism Theobroma cacao

Protein

Length 648aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018126000.1
Definition PREDICTED: probable rhamnogalacturonate lyase B [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description Rhamnogalacturonate lyase
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K18195        [VIEW IN KEGG]
EC 4.2.2.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTCGCCTATTGGGGTTCAACTCCATATTCAGGATACTCATGTGATGATAGATAATGGCGTACTCCAACTCACATTATCGAACCCTGATGGAATTGTGACCGGCATTCGATATAATGACATTGACAACTTGCTGGAAGTTCTAAATGAAGAATCGAATAGAGGGTACTGGGACCTTGTGTGGAGCTCACCGGGTACTGCTGGAACTGCTGGCTTATTTGACGTGATTAAGGGAACAATTTTTAAGGTTATAGTGGAAAATGTGGATCAAGTTGAGGTTTCCTTCACAAGAACGTGGGATTCCTCCCAAGAGGGCAAGGTTGTTCCTCTAAATATAGACAAAAGGTTCATTGTGCTGCGTGGTTGCTCAGGCTTCTACTCTTATGCCATTTATGAGCACTTGAAAGATTGGCCTGGTTTTAACCTCGCTGAAACCAGAATCGCATTCAAGCTGAGAAAAGACAAGTTTCACTACATGGCTATGGCAGACAATAGGCAAAGATACATGCCTCTGCCTGATGACCGGTTGCCAGGAAGAGGTCAAGCCCTGGCTTACCCAGAAGCAGTCCTCCTGGTTGACCCCGTGGAGCACGAGTTGAAAGGAGAGGTCGATGACAAGTACCAATATTCATGTGATAACAAAGACAGCCAAGTTCACGGCTGGATACGCACTGACCCACCAGCAGTGGGATTCTGGATGATCACGCCCAGCAACGAGTTCCGCTCCGGTGGACCCGTCAAACAGAACTTAACCTCTCATGTGGGCCCCACAACCCTTGCTGTGTTTCTCAGTGCTCATTATTCTGGAGAAGACCTGGTGCCGAAATTCAGTGCCGGCGAGGCCTGGAAGAAAGTCTTTGGCCCTGTTTTTATCTATCTCAACTGTGCAATGGAAGGAGATGATCCGCTTTCGCTTTGGGAGGATGCTAAAGAACAGATGATTACTGAATTCCAGAGCTGGCCTTACACTTTCCCAGCTTCCGAGGATTTTCCTAAATCGGACCAACGGGGTAGTGTTAGCGGCAGACTTCTTGTGCATGACAGGTACGTTAGTGATGACAACATACCAGCAAATGGAGCTTACATAGGATTGGCTCCACCAGGGACTGCTGGGTCATGGCAAAGAGAATGCAAGGACTACCAATTTTGGACCCAAACAGACGTGAATGGCTATTTTTTGATCAATGACATACGAACTGCTGATTATAACCTTTACGCATGGGTTCCTGGTTTTATTGGAGATTATCGATCTGATGTTGCTATCACAATAACTCCAGGTAGTTATATTGAGGTGGGTGATCTCATTTATGAACCTCCGAGAAATGGACCTACATTGTGGGAAATAGGCATCCCTGATCGTTCTGCTGCAGAATTTTATGTCCCAGATCCTAATCCTAAGTACATCAACAAACTTTATGTTAACCATCCGGATCGGTTTAGACAGTATGGACTATGGGAAAGATATGCAGAACTGTATCCTGTTGGAGATCTAGTTTACACAGTTGGCAGTAGTGACTATAAAAAAGATTGGTTCTTTGCCCAAGTAACCAGGAAGACTGATAATAACAAGTATCAAGGAACAACATGGCAAATTAAATTCAAACTTGACAATGTGGATCAGAGTAATTCCTATAAATTACGCTTGGCGATTGCATCTGCAACTTTATCCGAATTGCAGATTCGAATTAATGATCCAAAAGGAAATCCTTTATTTTCAAGTGGACTATTCGGGAGGGACAACTCAATTGCAAGGCATGGAATTCATGGACTCTACTGGCTGTACAATGTAGATATACCTGGAAAACTGCTTGTACAAGGTGATAACACTATCTTCCTGACACAGCCACGAAGCAGCGGCCCGTTTCAAGGGATTATGTATGATTACATACGATTAGAAGGCCCCCCAACTTCAAGTTCCAAGAAAGAACATATGAGTGCATTGTCATAG
Protein:  
MSPIGVQLHIQDTHVMIDNGVLQLTLSNPDGIVTGIRYNDIDNLLEVLNEESNRGYWDLVWSSPGTAGTAGLFDVIKGTIFKVIVENVDQVEVSFTRTWDSSQEGKVVPLNIDKRFIVLRGCSGFYSYAIYEHLKDWPGFNLAETRIAFKLRKDKFHYMAMADNRQRYMPLPDDRLPGRGQALAYPEAVLLVDPVEHELKGEVDDKYQYSCDNKDSQVHGWIRTDPPAVGFWMITPSNEFRSGGPVKQNLTSHVGPTTLAVFLSAHYSGEDLVPKFSAGEAWKKVFGPVFIYLNCAMEGDDPLSLWEDAKEQMITEFQSWPYTFPASEDFPKSDQRGSVSGRLLVHDRYVSDDNIPANGAYIGLAPPGTAGSWQRECKDYQFWTQTDVNGYFLINDIRTADYNLYAWVPGFIGDYRSDVAITITPGSYIEVGDLIYEPPRNGPTLWEIGIPDRSAAEFYVPDPNPKYINKLYVNHPDRFRQYGLWERYAELYPVGDLVYTVGSSDYKKDWFFAQVTRKTDNNKYQGTTWQIKFKLDNVDQSNSYKLRLAIASATLSELQIRINDPKGNPLFSSGLFGRDNSIARHGIHGLYWLYNVDIPGKLLVQGDNTIFLTQPRSSGPFQGIMYDYIRLEGPPTSSSKKEHMSALS